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Abstract 

We prove sharp upper bounds on the entropy numbers of the p- 

sphere in in the case k > d and 0 < p < g < oo. In particular, we close a gap 
left open in recent work of the second author, T. Ullrich and J. Vybiral. We also 
investigate generalizations to spheres of general finite-dimensional quasi-Banach 
spaces. 


1 Introduction 

Entropy numbers are a central concept in approximation theory. They quantify the 
compactness of a given set with respect to some reference space. For K being a subset of 
a hnite-dimensional (quasi-)Banach space Y, the fc-th (dyadic) entropy number ek{K, Y) 
is dehned as 

2^-1 

ek{K, Y) = min |e > 0 : H C fj (@ + some Xi,..., X 2 fe-i ^ | ; (1) 

i=i 

where By denotes the closed unit ball in the space Y. A detailed discussion and historical 
remarks can be found in the monographs DEI- 

Recall the (quasi-)norms ||x||p := (y)j=i for 0 < p < cx3 and ||x||oo := 

max(L;^ |xi|. The entropy numbers 6^(5^,^) of the closed unit balls = {x G : 
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Il^^llp < 1} in (-q = II • llq) for 0 < p, g < cx) are well-understood for more than a 
decade now UllZli. For the reader’s convenience, we restate the result. Recall that 
/ < g means that there is a constant C > 0 such that / < Cg] the notation f ^ g 
means that g ^ f and f ^ g. 

Lemma 1. Let 0 < p < q < oo and let k and d he natural numbers. Then, 

1 : 1 < A; < log((i), 

y: log{d)<k<d, 

2-kld^llq-llp . ^ _ 

The constants behind "x” do neither depend on k nor on d. They only depend on the 
parameters p and q. 

Regarding p-spheres = {x G : ||x||p = 1}, one would expect a similar behavior 
of the entropy numbers ek{Sp~^,i^). A rigorous proof, however, has been provided just 
recently in [B]. Let us restate this result, as well. 

Lemma 2. Let dEf^,d>2,0<p<q<oo and p = min{l,p}. Then, 




(^) 

2 -fc/(d-i)^i/g-i/p < fork>d. 


(ii) 


efc(Sp 


d-l 



1 

log(l+d/fc) ^ 


for 1 <k < logd, 
for \ogd < k < d. 


The proof largely mimics the corresponding proof for p-unit balls, including the 
volume arguments which are utilized in the case k > d. Once p < 1 however, the volume 
arguments get too coarse for the upper bound. They do no longer lead to the correct 
order of decay; instead of we only see 

In this work, we present an alternative proof for the case k > d which gives the 
correct order of decay l/{d—l). In essence, the proof relies on a scheme to construct a 
covering of from a covering of Bp~^. In Section [2l we prove 

Theorem 3. Let dGlN, (i>2, 0<p<g<cx3. Then, 

X fork>d. 

Finally in Section [21 we generalize the methods presented in Section [2] to study 
entropy numbers efc(Sx,R) for general hnite-dimensional quasi-Banach spaces X and 

F. 
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Notations For e G { — 1, l}'^ a sign vector, we denote the associated orthant by Qe ■ = 
{x G : 0 < signejXj < l}- Furthermore, by Hi we denote the hyperplane where the 
Fth coordinate of every vector equals 0 and Cj = {x G R^ : |xj| < \xj\ for j = 1,..., d}. 
Whenever we write |x| < \y\, then this is meant coordinate-wise: \xj\ < \yj\ for each 
j = 1,..., d. 

2 Entropy numbers of ^^-spheres 

Let us start this section with a simple proof that the entropy numbers of the p-sphere 
are always bounded from below by the respective entropy numbers of the p-unit 
ball Bp~^. The known behaviour of the latter immediately implies all the lower bounds 
in Lemma [2] and Theorem 121 

Theorem 4. Let 0 < p,q < oo. For all natural numbers k, we have 

Proof. Let £ = and choose an associated covering 

2k-l 

sp'c 

i=i 

Let Pi be the projection in R*^ onto the hyperplane Hi setting the hrst coordinate to 
zero. Now, identify and Bg~^ with its natural isometric embedding into Hi. Then 

and Pi(S^-') = 

and the linearity of Pi imply 

‘2k —1 2 ^ — 1 

Bp' = -P,(Sp‘) c U (Pix, + ePi(B^)) = U (Pixj + ESyi). 

j=i i=i 

By dehnition of the entropy numbers this gives ek{Bp~^, < e and proves the theo¬ 
rem. □ 

The upper bounds in (ii) of Lemma [2] directly follow from the corresponding upper 
bounds for B^ and the inclusion C B^. We turn to the proof of the upper bound 
on the entropy numbers of for k > d in Theorem [3l The covering construction is 
based on the bijective shifting map 

B^nQenH.^s^p-^nQenQ, 

X I—)■ X -f s(x)e, 

which shifts x by 0 < s(x) < 1 along the diagonal e until it hits the p-sphere. Since 
the vectors in the domain of all have the i-th coordinate equal to 0, the shift s(x) 
corresponds to the Pth coordinate of A^(x). Below we provide bounds on the difference 
between two shifts s(x), s{y) that depend on the £oo-distance of x and y. 
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Lemma 5. Let x,y E O Qe H Hi such that |a;| < \y\. Then 

s{y) < s(x) < s(y) + ||a; - y\\oo. 


Proof. We give the argument for p < oo, the proof is easily adapted for p = cxd. By 
symmetry, it is enough to check the claim for the positive orthant associated with e = 
(1,..., 1) and i = d. Then Xd = yd = 0 and the assumption |a;| < \y\ translates into 
0 < Xj < yj for j = 1,... ,d — 1. Moreover, a = s{x) > 0 and r = s{y) > 0 are given as 
the unique nonnegative solutions of the equations 


d-l 

E( 

i=i 


d-l 


Xi 


aP + = 1 


and 


= 1 . 


t=i 


Now T < a directly follows from 0 < Xj < yj for j = 1,... ,d — 1. The remaining 
inequality a < ||a; — i/||oo + 'risa consequence of 


d-l 

Y^{xj + af + a^ = 
i=i 


< 


< 


d-l 


i=i 


d-l 


(xj + ijjj - Xj) + rf + H 

i=i 


d-l 

- y\\oo + rY + 
i=i 


{\\x-y Woo Pry. 


□ 


Now it is easy to establish the following for A^: 

Lemma 6. For any x,y E O Qe C) Hi, we have 

ll^e(a^) - ^eiy)Woc < 2||a; - yWoo- 

Proof. For the moment, assume that |a;| < \y\. The previous lemma and |a;| < \y\ 
immediately give 


-Ik - pIIoo < \xj\ - \yj\ + s{x) - s{y) < |k - yWoo 

for every j = 1,... ,d. Hence, we have ||— A^(i/)||oo < |k — yWoo- 

Now for arbitrary x,y E B^ H Qe H Hi, let 2 : G B^ O Qe H Hi he the vector given 
by Zj = signcj min(kj|, Ipjl) for j = l,...,d. But for this vector, both \z\ < k| 
and \z\ < \y\ hold true. Hence, a simple application of the triangle inequality yields 

Theorem 7. Let 0 < p,q < 00 and d >2. For k > d it holds true that 


4 



Proof. We first treat the case q = oo. Let Afe,i{s) be a minimal e-covering of BpOQeHHi 
in Since HQeH Hi is a subset of a (rf — 1)-dimensional £p-ball, we have 

|A4,(<r)| 

Lemma E] implies that the set A^(A/"e,i(£^)) is a 2e-covering of Sp^^PlQeLlC,. Consequently, 

V(2e) := U U 

eG{-l,lp l<i<d 


is a 2e-covering of Moreover 

|A^(2£)|< 5; 5; |V.,(£)| 

eG{-l,l}‘* l<i<d 

By dehnition of entropy numbers, 

Hence the assertion follows for k > 2d + [logd] immediately from the upper bound in 
the third case of Lemma [TJ For d < k < 2d+ [log d], we have by monotonicity of entropy 
numbers and Lemma El (ii) that efc(Sp“\f'^) < ed(Sp“^,£^) < d~^/P; now, due to the 
assumption A; < 2d -|- [logd] we may estimate 2“® < and hence the assertion 

holds true for all k > d. 

The case of general g is a consequence of the factorization property of entropy num¬ 
bers, see for instance [H Section 1.3], 

< I|id : et ^ 


□ 


Remark 8. For p = q, the Mazur map 

Mp : 83"^ ^ X ^ ((signa:j)|a:jf/P)[Li 

provides a different technique to derive matching bounds for p < 1 in Lemma El This 
mapping forms a homeomorphism between the ^ 2 - and the £p-sphere. In particular, for 
0 < p < 2 the Mazur map is Lipschitz with constant L = L{p) independent of the 
dimension, see [9]. Hence, any e-covering of the 2-sphere yields an Le-covering of the 
p-sphere by mapping the centers to with the Mazur map. This shows that 

ek{^i-\ip)<LekiSt\i2), 

which allows to transfer the upper bounds for the case p = 2 in Lemma El to the case 

0 < p < 1. 
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3 Entropy numbers of spheres in general quasi-Ba¬ 
nach spaces 

In this section, we elaborate on the methods presented in the previous section, general¬ 
izing them to arbitrary hnite-dimensional quasi-Banach spaces. To this end, let X be 
equipped with some quasi-norm || ■ ||. Let Sx be the unit sphere in X. Let X* be 
the hyperplane Hi considered as a subspace of X, i.e. its unit ball is Bx H Hi. Let X* 
be the hyperplane Hi with the quasi-norm whose unit ball is the image of Bx under 
the orthogonal projection of R'^ onto Hi. Observe that the quasi-norms of Xj and X* 
coincide for X = f'p, 0 < p < cxo. That Xj and X* are the same is always true if the 
orthogonal projection onto Hi is a norm 1 projection, which is the case, in particular, if 
X is a Banach space. 

The following theorem provides a generalization of Theorem HJ 

Theorem 9. Let X and Y he R'^ with quasi-norms as above. For all natural numbers 
k, we have 

efc(Sx,R) > max efc(RxpL'*). 

l<i<d 

Proof. The proof of TheoremlUcan be easily adapted to this situation. Let e = efc(Sx, Y) 
and choose an associated covering 

2k-l 

Sx c IJ {xj + eBy). 

i=i 

Let Pi be the orthogonal projection in R'^ onto the hyperplane Hi setting the hrst 
coordinate to zero. Then we have 

-Pi(-By) = Byi and Pj(Sx) = Bxi- 

Since Pi is linear this implies 

Qk—l 

Bxi = Pii^x) c (PjXj -f ePyi) . 

i=i i=i 

By dehnition of the entropy numbers this gives efc(Pxq ^ ^ cind proves the theorem. 

□ 

To generalize the upper bounds, we have to assume a monotonicity property of the 
quasi-norm of X in each orthant Qe. Let us call X monotone if for any e G { — 1,1}'^ 
and for x,y & Qe with |x| < \y\ we have ||a;|| < ||?/||. Again, we dehne a bijective shifting 
map by 

Af : Px n Qe n Pi ^ Sx n Qe n Q, 

X X + s{x)e, 

which shifts x by 0 < s(a;) <1 along the diagonal e until it hits the sphere of X. Then 
analogues of Lemmas [5] and [6] hold true. 
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Lemma 10. Let x,y E X D Qe H Hi such that |x| < \y\. Then 

s{y) < s(x) < s(y) + ||x - y\\oo. 

Lemma 11. For any x,y E X Hi, we have 

ll^f (a^) - ^eiy)\\oo < 2||x - i/lloo- 

We leave the modifications of the proofs to the attentive reader. Now we obtain the 
following generalization of Theorem [71 

Theorem 12. Let X and Y be with quasi-norms as above and assume that X is 
monotone. Let k> d — flogd]. Then 

ek{^x,Y) < 2 ||Jd : F ^ £^|| max ek-d-iiosd]{Bx„iV)- 

l<i<d 

Proof. At first, we treat the case Y = Let J\fe^i{e) be a minimal e-covering of 
Bx n Qe n iLj in Since Bx H Hi is a. subset of a Bxi, we have 

where denotes the covering number, that is the smallest possible number 

of balls eB^ necessary to cover Bxi- Lemma fTTl implies that the set A^(A/’e,i(£)) is a 
2e-covering of Sx H Qe H Cj. Consequently, 

M(2e) := U U 

ee{-l,l}'^ l<i<d 


is a 2e-covering of Sx- Moreover 

d 

|V(2£)|< Y1 E 

eG{-l,l}d l<i<d i=l 

By definition of entropy numbers. 


eki^xJio) < 2 max ek-d-\iogd-]{Bxi, 

l<i<d 

The case of general F is a consequence of the factorization property of entropy 
numbers: 

ei=(Sx,y) < l|id: C ^ r|| et(S.v.C‘)- 

□ 

To conclude this note, let us discuss the situation that both X and F are Banach 
spaces with norms being symmetric with respect to the canonical basis. Then it is 
possible to make the bounds in the Theorems |9] and [12] concrete by applying the results 
of Schiitt [7]. We call a Banach space X with norm || ■ || symmetric if the canoncial 
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basis {ei,..., e^} has the following property. For all permutations vr, all sings e*, and all 
Xi G R, we have 


i=l 




d 

xjCj 

i=l 


Below we use the notation 


k 

^E{k) = II 

i=\ 

where E denotes a d-dimensional, symmetric Banach space and {ci,..., e^} its canonical 
basis. 


Corollary 13. Let X and Y he symmetric Banach spaces and assume the canonical 
basis in both spaces to be normalized. Then, we have 


for k < d — 1 and 


ek{^y\Y) > 


1 Ay(£) 

— max -—— 
2e i=k,...,d-i \x[t) 


1 2-fc/(d-i) Ay(d - 1) 

e Xx{d — l) 

for k >2d+ [log d] — 1. 


< ek{Sy\Y) < 32c 


Xyi^d — 1 ) 
Xx{.d — 1 ) 


Proof. Since X is symmetric, so is X*. Moreover, X* and Xj are isometrically isomorphic 
for i j hj the symmetry of X. The same holds true for Y. Hence, ek{Bxi,Yi) = 
ek{Bxj, Yj) for i ^ j and all k. By virtue of [71 Theorem 5 (1)] we know that 


ek{Bxi, Yi) > 


1 Ay^(£) 

— max -—. 
2e £=k,...,d-i XxAi) 


Combining with Theorem [3 and noting that = Ax(f') as well as AviW - Ay(£) 

for £ = 1,..., d — 1, we obtain the stated lower bound for < d — 1. 

Regarding the bounds for fc > 2d -|- [logd] — 1, we get from [71 Theorem 5 (2)] that 

1 2 -fc/C-i) < e,{Bx,,Y,) for A; > d - 1 (2) 

e Axi (d - 1) 

as well as 

em{Bx,,it") < c 2—for m > d - 1. (3) 

Equation ([2]) and Theorem [9] immediately give the lower bound. For the upper bound, 
put m = k — d — [logd]. Noting that ||id : Y —)■ = Ay(d) < 2Ay(d — 1) and 

Xgd-i{d — 1) = 1, Theorem [12] combined with ([3]) then gives 


efc(S^-\ Y) < 4c 2 ('^+n°g'^l)/C-b ^ for fc > 2d + [fogd] - 1. 

Xx{d — 1) 


Since we have assumed that d > 2, we can estimate < 8. □ 














Remark 14. In contrast to Theorem [71 the statement of Corollary [13] cannot be ex¬ 
tended to fc > d in a simple way as we do not know how ed(S^^, Y) behaves. 

Example 15 (Lorentz norms). For x G let x* = (x*,...,a:*) denote the non¬ 
increasing rearrangement of x. We consider generalized Lorentz norms 

i=l 

for 0 < g < oo and a non-increasing function w : [1, oo) —)• (0, cx)) with ic(l) = 1, 
limt^oo= 0 and = oo- Then 


^w,q{k') 


k 



{twnrY^ 


Particularly, for the choice w{t) = for 0<g<p<oowe obtain the standard 

Lorentz norm 


X 


w^q 







2=1 


A simple calculation, where one approximates the sum by an integral, shows that 
Ap,g(/c) X So, if we consider X = and Y = if, we obtain from Corollary 


[T3] that 




2-k/(d-l) 




for k>2d+ [log d] — 1. 


Example 16 (Orlicz norms). A convex function M : R+ —)■ R+ with M(0) and M{t) > 0 
for t 7 ^ 0 is called an Orlicz function, which we associate with the norm || • ||m given by 


d 

x\\m ■= inf{p > 0 : ^ M(|xi|/p) < 1}. 
2=1 


It is easy to calculate that 


Xuik) 




1 

M-i(l/fc)’ 


where M~^ is the inverse function of M. Let us consider some specihc examples. In the 
following we always choose Y = if for some 0 < g < oo. 

(i) Let M{t) = exp(—1/t^) for 0 < t < 1/2. See [3] for an application where the 
associated Orlicz norm appears naturally. We have M~^{t) = \/— Int and consequently 


efc(Sl7\£j) X \/h/d d^/^ 
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(ii) The following Orlicz function is taken from [5]. For p > 1 and a > 0, let 
M on [0,to) be given by M{t) = t^’ln(l/t)". Here, to = l/exp(/5 + — 7) with 

{3 = a{2p — 1)/(j5^ — p) and 7 = (a^ — «)/— p)- There is a tg < to such that 

( \ i/p 

Hence, for k sufficiently large, we find 

efc(S^-\£j) X log(d)"/P 
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